Picture for Nan Duan

Nan Duan

AdaCodec: A Predictive Visual Code for Video MLLMs

Add code
Jun 01, 2026
Viaarxiv icon

Embodied3DBench: Benchmarking Low-Level Embodied Spatial Intelligence of Vision Language Models

Add code
May 27, 2026
Viaarxiv icon

OmniNFT: Modality-wise Omni Diffusion Reinforcement for Joint Audio-Video Generation

Add code
May 12, 2026
Viaarxiv icon

Thinking with Novel Views: A Systematic Analysis of Generative-Augmented Spatial Intelligence

Add code
May 11, 2026
Viaarxiv icon

Awaking Spatial Intelligence in Unified Multimodal Understanding and Generation

Add code
May 05, 2026
Viaarxiv icon

A Systematic Post-Train Framework for Video Generation

Add code
Apr 28, 2026
Viaarxiv icon

JoyAI-RA 0.1: A Foundation Model for Robotic Autonomy

Add code
Apr 22, 2026
Viaarxiv icon

Near-Future Policy Optimization

Add code
Apr 22, 2026
Viaarxiv icon

OpenSpatial: A Principled Data Engine for Empowering Spatial Intelligence

Add code
Apr 09, 2026
Viaarxiv icon

SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing

Add code
Apr 06, 2026
Viaarxiv icon